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■ Abstract 

^"*^ ' In this paper, we use geometry of numbers to relate two dual Diophantine problems. 

This allows us to focus on simultaneous approximations rather than small linear forms. 
As a consequence, we develop a new approach to the perturbation theory for quasi- 
periodic solutions dealing only with periodic approximations and avoiding classical small 
divisors estimates. We obtain two results of stability, in the spirit of the KAM and 
Nekhoroshev theorems, in the model case of a perturbation of a constant vector field on 
' the n-dimensional torus. Our first result, which is a Nekhoroshev type theorem, is the 

^ . construction of a "partial" normal form, that is a normal form with a small remainder 

whose size depends on the Diophantine properties of the vector. Then, assuming our 
vector satisfies the Bruno-Riissmann condition, we construct an "inverted" normal form, 
recovering the classical KAM theorem of Kolmogorov, Arnold and Moser for constant 
vector fields on torus. 

o _ 

1 Introduction and results 

Let a be a non-zero vector in 1". A quasi-periodic solution with frequency a is simply an 
integral curve of the constant vector field X a = a on the n-dimensional torus T" = K n /Z n . 
Constant vector fields on a torus have only trivial dynamical properties, but this is not the case 
for their analytic perturbations, which are of great interest both in theoretical and practical 
aspects in celestial mechanics and Hamiltonian systems. 

First let us recall the dynamical properties of a constant vector field X a , which depends 
on the "Diophantine" properties of a (note that these properties depend on a only through 
its equivalence class under the action of PGL(n,Z)). If the components of the vector a are 
independent over the field of rational numbers Q, then X a is minimal, that is every orbit is 
dense on T n , and moreover X a is uniquely ergodic, that is every orbit is equidistributed with 
respect to the Haar measure. If the components of the vector a are dependent over Q, and if 
m is the number of independent rational relations, then there is an invariant foliation on T n , 
whose leaves are diffeomorphic to T n ~ m and whose leaf space is diffeomorphic to T m , and the 
restriction of X a to each leaf is minimal and uniquely ergodic. Here we will not always make 
the assumption that m = 0, and we will write d = n — m. 
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Adding a small analytic perturbation P to our vector field X a , a simple description of the 
orbits of X a + P is in general no longer available, and since Poincare it is customary to try to 
conjugate the perturbed vector field to a "simpler" one, which is usually called a normal form. 
In our context, the best one can hope for is to conjugate the perturbed vector field to a vector 
field of the form X a + N, where the vector field ./V commutes with X a . To construct such a 
conjugacy to a normal form, one usually has to integrate along the integral curves of X a and 
difficulties arise as these curves are in general non-compact: more precisely, values of linear 
forms k ■ a, k € Z n , which might be very small for integers k with large norms, appear in the 
denominator of the function generating the conjugacy. These are usually called small divisors, 
and they show that more quantitative information on the values of \k ■ a\, k € Z n , that is on 
the Diophantine properties of a, is needed in order to carry on a perturbation theory. By 
Dirichlet's box principle, when m = and hence d = n, for a given Q > 1, one can always 
find a non-zero integer k with \k\ < Q for which \k ■ q[ _1 > \a\~ 1 Q n ~ 1 . In the general case 
where d is smaller than n, adding further multiplicative constants depending on a, Q n ~ l can 
be replaced by Q d ~ l . This gives a first way of quantifying these Diophantine properties, by 
introducing a real- valued function depending on a, that we shall call here = &' a , which has 
to be non-decreasing, piecewise constant, unbounded and grows at least as Q d ~ l (see §2.1 for 
a precise definition; this function is denoted by ^ there and is part of a family of functions 
^>' k for 1 < k < d). Assuming that this function grows at most as a power in Q (which 
was a condition introduced by Siegel in a related problem, and is known as a Diophantine 
condition) , this problem of small divisors was solved by Kolmogorov ( [Kol53] , [Kol54] ) , who 
introduced a new fast-converging iteration method similar to a Newton iteration method. 

Yet there is one special situation in which these difficulties do not appear, namely in the 
case m = n — 1, hence d = 1, where the integral curves are all periodic, and hence compact. 
For such a vector, that we shall denote here u to distinguish with the general a, this means 
that there exists a real number T > such that Tuj € Z n , and assuming that T is minimal 
with this property, T is the minimal common period of all the integral curves. By extension, 
such a vector oj is called periodic, and it is characterized by the fact the (n — l)-vector defined 
by the ratios of its components (or equivalently, its image in the projective space) is rational. 
The perturbation theory in this case is much simpler, it was already known to Poincare and 
a classical Picard iteration method is here sufficient. By Dirichlet's box principle, it is always 
possible to approximate a non-zero vector a by a periodic vector u as follows: when m = 
and hence d = n, for a given Q > 1, one can always find a periodic vector uj, with period T, 
such that \Ta — Tui\ < Q~ l and (a)" 1 < T < lal -1 ^™ -1 . As before, in the general case where 
d is smaller than n, one obtains Q d_1 instead of Q™ -1 but with other constants depending on 
a. However, such a periodic approximation does not give us enough information in general. 

It is in fact possible to find not just one but d independent periodic approximations in 
the following sense: for any Q > 1, there exists independent periodic vectors u±, . . . , LOd-, with 
periods Ti, . . . ,T^, such that \Tja — TjCOj\ < Q^ 1 for 1 < j '• < d. This last inequality will be 
expressed by saying that each coj is a Q-approximation of a. Of course, in general the upper 
bound on the periods Tj, which is at least of order Q^ 1 , can now be much larger, but we can 
introduce yet another real- valued function, that we call ^ = ^> a , which also has to be non- 
decreasing, piecewise constant, unbounded and grows at least as Q d ~ l and which gives the 
growth of the upper bound on these periods in terms of Q (once again, see §2.1 for a precise 
definition where this function is denoted by and is also part of a family of functions ^ for 
1 < k < d, see also Proposition 2.3 in §2.2 for the result). This is another characterization of 
the Diophantine properties of a, and our main Diophantine result, Theorem 2.1, will imply 
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(as a particular case) that this characterization is equivalent to the previous one, in the sense 
that the functions VP and are the same up to constants depending on d and a (Theorem 2.1 
states that the functions and W d+1 _ k are the same up to constants depending on d and 
q, for any 1 < k < d). This will be proved in §2, by using arguments from the geometry 
of numbers: more precisely, our result will be a consequence of the relation between the 
successive minima of a convex body with respect to a given lattice and the successive minima 
of the polar convex body with respect to the polar lattice. 

Then, in §3, we shall develop a new approach to the perturbation theory for quasi-periodic 
solutions based uniquely on periodic approximations. Roughly speaking, from an analytical 
point of view, we will reduce the general quasi-periodic case 1 < d < n to the periodic case 
d = 1. One could probably say that perturbation theory involves some arithmetics, some 
analysis and some geometry. Since we have nothing new to offer as far as the geometry is 
concerned, we shall restrict to a situation where it simply does not enter into the picture, 
and this is the case if the frequency of the solution is fixed. This is why we will consider 
perturbation of constant vector fields on the torus (we could have also considered perturbation 
of linear integrable Hamiltonian systems, this is quite similar). In this context, we will apply 
our method and prove two results in the spirit of the KAM and Nekhoroshev theorems. Our 
first result, Theorem 3.1, which is valid for any 1 < d < n and any a € W 1 \ {0}, is the 
existence of an analytic conjugacy to a "partial" normal form, that is we will construct an 
analytic conjugacy between X a + P and a vector field X a + N + R, where X a + N is a normal 
form and R a "small" remainder. The smallness of this remainder depends precisely on the 
Diophantine properties of a, and if the latter satisfies a classical Diophantine condition, the 
remainder is exponentially small (up to an exponent) with respect to the inverse of the size of 
the perturbation. This statement is analogous to the Nekhoroshev theorem for perturbation 
of linear integrable Hamiltonian systems. Our second result, Theorem 3.3, which is valid only 
for d = n and for a G R n \ {0} satisfying the Bruno-Russmann condition, is the existence of 
an analytic conjugacy between a modified perturbed vector field X a + Xp + P, where Xp = f3 
is another constant vector field that depends on P, and the unperturbed vector field X a . 
Note that for d = n, constant vector fields are exactly vector fields N for which [N, X a ] = 0, 
hence the theorem states the existence of a vector field N which commutes with X a such that 
X a + N + P is analytically conjugated to X a , and therefore this can be called an "inverted" 
normal form. This statement is exactly the classical KAM theorem for vector fields on the 
torus of Kolmogorov, Arnold and Moser. 

To conclude, let us point out that the idea of introducing periodic approximations in per- 
turbation theory is due to Lochak. In [Loc92], using only Dirichlet's box principle, Lochak 
gave drastic improvements, both from a qualitative and a quantitative point of view, on 
Nekhoroshev's stability estimates for a perturbation of a convex integrable Hamiltonian sys- 
tem. Lochak's argument strongly relies on convexity, and this assumption enables him to use 
just one periodic approximation to derive an essentially "optimal" result. However, in the 
general case, one periodic approximation seems to be not enough. The idea has been then 
taken up by Niederman in [Nie07], and further improved in [BN11], where linearly indepen- 
dent periodic approximations were used to prove Nekhoroshev's estimates for a much larger 
class of integrable Hamiltonian systems. However, the arguments there were based only on 
successive applications of Dirichlet's box principle, and unlike [Loc92], the results obtained 
were far from being "optimal" (this also stems from the fact that Nekhoroshev's estimates 
are much more complicated to derive without the convexity assumption). At last, we should 
also point out that using a multi-dimensional continued fraction algorithm due to Lagarias, 
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Khanin, Lopes Dias and Marklof gave a proof of the KAM theorem with techniques closer 
to renormalization theory (see [KLDM06], [KLDM07]). Even though their arguments deal 
with small divisors yet in a different way, they were still based on Fourier expansions and 
elimination of Fourier modes, which is quite different from the techniques we will use in this 
paper, since Fourier analysis will not be involved at all. 

2 A Diophantine duality 

We gather in this section all the Diophantine part of this text. In §2.1 we define the functions 
^>k and ^>' k mentioned in the Introduction, and we state our main result: ^ and ^' d+1 _ k 
are essentially equal. We don't claim this result to be new, but we did not find it in the 
literature so we provide a complete proof based on a classical duality result of geometry of 
numbers. In the special case k = d (which is the one used in the rest of this paper), this 
Diophantine duality is related to inhomogeneous approximation (see [Cas57], Chapter V, §9), 
in particular to transference results due to Khintchine (see Theorem VI in Chapter XI, §3 of 
[Cas59], Theorem XVII in Chapter V, §8 of [Cas57], and also Lemma 3 of [BL05] and Lemma 
4.1 of [Wal], p. 143). We would like to point out the fact that these transference results, 
between homogeneous and inhomogeneous Diophantine approximations, are distinct from the 
well-known "Khintchine's transference principle" (stated as Theorem 5A, p. 95 of [Sch80]) 
which connects the existence of solutions in dual homogeneous Diophantine approximation 
problems. By the way we believe that this transference principle of Khintchine is in general 
not good enough for the applications to the KAM and Nekhoroshev theorems (except in the 
very special case where the vector satisfies a classical Diophantine condition with the best 
exponent). 

In §2.2 we derive some corollaries and state other useful Diophantine properties (some of 
them related to Diophantine vectors and the Bruno- Riissmann condition). In this way, all 
Diophantine properties used in our application to the perturbation theory for quasi-periodic 
solutions are stated in §2.2. 

We recall in §2.3 the definition and classical properties of the successive minima of a 
convex body with respect to a lattice, including the main result from geometry of numbers 
we rely on (namely (2.7)). Using these tools we prove in §2.4 the results stated in §§2.1 or 
2.2, except for Proposition 2.2 which is proved in §2.5. At last, we focus in §2.6 on the case 
of one number, in which the continued fraction expansion enables one to compute ^ and ^' k 
explicitly. 

2.1 The general result 

Let n > 2, and a = (cto, • • • , a n -i) G M n \ {0}. Without loss of generality, we assume that 
\a\ = \cxo\, where \a\ = maxo<i< n _i \oti\. 

Let us say that a vector subspace of W 1 is rational if it possesses a basis of vectors in 
Q n (sec for instance [Bou62], §8), and let F = F a be the smallest rational subspace of W 1 
containing a. The dimension d of F, which is also the rank of the Z-module Z n n F, is called 
the number of effective frequencies of a. Of course, F = W 1 if and only if ao, • • • , & n -\ are 
linearly independent over Q. 

Definition 1. For k € {1, . . . , d} and a real number Q > 1, we let ^fk(Q) denote the infimum 
of the set of K > such that there exist k linearly independent vectors iGZTlF satisfying 
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the inequalities 



x\ < K, 

Xq(X — aox\ < Q 



-i 



(2.1) 



For any Q > 1, ^k(Q) exists because it is the infimum of a non-empty set, that is for 
Q > 1 and K > sufficiently large, the set of fc linearly independent vectors x € Z ra n F 
satisfying (2.1) is non-empty; we will prove this in §2.4 (within the proof of Theorem 2.1), 
eventhough a direct proof would be possible. This set being also finite, the infimum in the 
above definition is in fact a minimum. In §3, we will use ^f^ only when k = d, which amounts 
to finding a basis of F consisting in vectors of Z n such that (2.1) hold. On the other hand, 
when k = 1 the problem is to find a non-zero vector in Z n Pl.F satisfying these inequalities. We 
will prove in §2.2, Proposition 2.3, that for Q sufficiently large, each non-zero integer vector 
satisfying (2.1) gives a periodic vector cu, with a period T bounded (up to a constant) by K, 
which is essentially a Q-approximation of a in the sense that \Ta — Tuj\ is bounded (up to a 
constant) by Hence when k = d, we will find d linearly independent periodic vectors, 

with periods essentially bounded by ^d(Q)-> and which are essentially Q-approximations of 
a. Of course, ^k(Q) depends on a, but for simplicity, we shall forget this dependence in 
the notation (except in §3, where the function ^ will be denoted by ^ a ). This function Vl'fc 
is central in our approach; it is non-decreasing, piecewise constant, and left-continuous (see 
Proposition 2.2 in §2.2 below). 

Definition 1 can be easily stated in terms of successive minima (see §2.3 below). A variant 
of this definition would be to ask for a family of k linearly independent vectors which can be 
completed to form a basis of the Z-module Z n n F (see Proposition 2.3 in §2.2 below); this 
would induce only minor changes. 

The classical properties of successive minima (recalled in §2.3) will allow us in §2.4 to 
prove that ^fe(Q) exists, that it is a positive real number for any Q > 1, and to relate the 
function Vt^ to the "dual" one ^f' k we define now. 

Definition 2. For k € {1, . . . , d} let C' k denote the least integer Q > 1 for which there exist k 
linearly independent vectors x € Z, n Pi F such that \x\ < Q. For any real number Q > C' k , we 
let ^>' k (Q) denote the supremum of the set of K > such that there exist k linearly independent 
vectors x £ Z n n F satisfying the inequalities 



It is clear that ty' k (Q) exists for any Q >C k because there are only finitely many non-zero 
vectors x G IT 1 PI F with \x\ < Q, and they satisfy x ■ a ^ 0. From this it is also clear that 
the supremum in the above definition is in fact a maximum. The constant C' k will not be 
very important in this paper, because we are specially interested in ^' k (Q) when Q tends 
to infinity. Anyway, if «0; ■ • • , «n-i are linearly independent over Q then C' k = 1 because 
F = R n . 

The most interesting case is k = 1, which amounts to finding a non-zero vector in Z n n F 
satisfying (2.2). Of course, ^' k (Q) depends on a, but as before, we shall not take into account 
this dependence in the notation. 

We will prove in §2.4 that the functions ^>k and ^' d+1 _ k are equal up to constants that 
will be irrelevant in the applications. The precise result is the following (in which we have 
not tried to optimize the constants neither the dependence in the lattice U l PI F) . 




(2.2) 
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Theorem 2.1. Let Qo = max(l, (n + 2)C^|ck| 1 ), and 

c\ = (n + l) _1 |o[, C2 = (n + 2) _1 |a|, C3 = (det A) 



2 c?!|q|, C4 = 2C3. 



If Q > Qo, then we have the inequalities 



Cl V d+1 _ k (c 2 Q) < ¥ fc (Q) < c 3 ^ +1 _ fc (c 4 Q). 



As the proof shows, it is possible to replace c 2 with any real number less than c\. With 
this improvement, the first inequality becomes essentially optimal in the case n = 2 (see 
Proposition 2.7 in §2.6 below, and the remark following it). On the opposite, the constants 
C3 and C4 can be improved (for instance using refined versions of (2.7)), but we did not try 
to go any further in this direction because the values of these constants are not relevant in 
the applications we have in mind. 

2.2 Other Diophantine results 

We gather in this section some corollaries of Theorem 2.1, and other Diophantine results 
useful in this paper. To begin with, let us state some properties of the functions ty k and ^' k . 

Proposition 2.2. Let k € {1, . . . , d}. The functions and Vl/1 are non- decreasing, piecewise 
constant, and have limit +00 as Q — > +00. More precisely, there exist increasing sequences 
(Qe)e>i an d (Q'e)e>i, with limit +00, such that: 

• is constant on each interval ]Qg,Qi + i] and on [l,Qi], with Qi > 1; 

• ty' k is constant on each interval [Q'e,Qg + i[, with Q\ = C' k ; 

• The values ^k(Qe) and the arguments Q' e are integers. 

In particular, ^ k is left- continuous and W k is right- continuous. 

It is not difficult to prove this result directly; however we will deduce it in §2.5 from 
an interpretation of ^>k and ^ k in terms of sequences of minimal families, in the spirit of 
Davenport and Schmidt. 

Then, let us recast the definition of the function in terms of approximations of a by 
periodic vectors. Recall that a vector w GR"\ {0} is T-periodic, T > 0, if Toj € IT 1 . 

Proposition 2.3. For any Q > max(l, ci|a| -1 ) ; there exists d periodic vectors lo±, . . . ,ujd, of 
periods T±, . . . , T^, such that T\uji, . . . , T^uid form a Z-basis o/Z"nF and for j € {1, . . . , d}, 



We will prove this proposition at the end of §2.4 below, after the proof of Theorem 2.1. 

Finally, we have stated Theorem 2.1 in a very general setting, since we believe that this 
generality can be useful in applications. However, in §3, we will be mainly interested in 
the special case where k = d, and Theorem 2.1 states that the functions ^ and ^[ are 
equal up to constants. Usually, in the perturbation theory of quasi-periodic solutions, the 
Diophantine properties of a are quantified by the function applying Theorem 2.1, we will 
now reformulate these properties in terms of the function 



a-LUj\< d(\a\TjQ) 



-1 



< T j < H^d^diQ). 



6 



Definition 3. Given a real number r > d — 1, a vector a belongs to T> T d if and only if there 
exist two constants Q' a > C[ and C' a > such that for all Q > Q' a , ^[(Q) < C' a Q T . 

Vectors in T> T d are called Diophantine vectors with exponent r; they are usually defined 
only when d = n. The corollary below is an immediate consequence of Theorem 2.1. 

Corollary 2.4. Given r > d — 1, a vector a belongs to T> T d if and only if there exist two 
constants Q a > 1 and C a > such that for all Q > Q a , ^d(Q) < C a Q T . 

Another class of vectors which is well-studied in connection with perturbation of analytic 
quasi-periodic solutions is the class of vectors satisfying the Bruno-Riissmann condition. 

Definition 4. A vector a belongs to Bd if and only if there exist a continuous, non- decreasing 
and unbounded function $':[!, +oo[— >■ [1, +oo[ such that & > ^[ and 



Vectors in Bd are called Bruno-Russmann vectors, and as before, they are usually defined 
only in the case d = n. This is not the original definition of Riissmann (see [RiisOl] for 
instance), but is equivalent to it (for d = n): instead of <£' > VP^, one requires that \k • a| _1 < 
<l>'(|fc|) for all k G Z™\{0}, but in view of the definition of ^[ and the fact that the supremum 
is reached in this definition, these are equivalent. 

The corollary below is also an immediate consequence of Theorem 2.1. 

Corollary 2.5. A vector a belongs to Bd if and only if there exist a continuous, non- decreasing 
and unbounded function 3> : [1, +oo[— > [1, +oo[ such that <I> > ^ and 



In the appendix, we will show in fact that one can always find a continuous, non-decreasing 
and unbounded function $ : [l,+oo[— > [l,+oo[ such that ^d(Q) < &{Q) < ^d(Q + 1) for 
Q > 1: therefore it is equivalent to ask for the integral condition with ^ instead of <£. Of 
course, the same comment applies to VD^ in the original definition. 

2.3 Successive minima 

In this section we state the definition, and several classical properties, of the successive min- 
ima of a convex body with respect to a lattice. Standard references on this topic include 
[Cas59] (Chapter VIII) and [GL87]. Classical results will be sufficient for our purposes; they 
are contained in the first edition [Lek69] of [GL87]. The interested reader will find in these ref- 
erences much more than what we present here, including proofs, attributions and refinements 
of the results we state. 

Let V be a Euclidean vector space of dimension d > 1. Let A be a lattice in V, that 
is a discrete Z-submodule of rank d. This means that A = Zei + . . . + Ze^ for some basis 
(ei, . . . , e<f) of V. Of course the most important example is 7L d if V = M. d . 

Let C C V be a convex compact subset, symmetric with respect to the origin (that is, 
—x G C for any x £ C), with positive volume. Then C contains a small open ball around the 
origin. For any A G M we let AC denote the set of all Xx with x G C. 
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Let k G {1, . . . , d}. Then the set of all positive real numbers A such that AC n A contains 
k linearly independent points is a closed interval [Afc(C, A), +oo[: the positive real number 
X k (C,A) defined in this way is called the k-th successive minimum of C with respect to A. 
In particular, X\(C, A) is the least positive real number A such that AC fl A / {0} (since C 
is compact, it is bounded so that AC fl A = {0} if A > is sufficiently small, because A is 
discrete). On the other hand, Xd(C,A) is the least A > such that AC n A contains a basis of 
V. The definition of successive minima yields easily the following properties: 

< Ai(C, A) < . . . < X k (C, A) < ... < X d (C, A), (2.3) 

X k (C,A) > X k (C,A') (2.4) 
for any 1 < k < d, if C C C and A C A', and 

A fc (/iC,A) = f i- 1 X k (C,A) (2.5) 

X k (C,fiA)=fxX k (C,A) (2.6) 

for any 1 < k < d and any /i > 0. 

A basis of V consisting of vectors of A is not always a Z-basis of A: the Z-submodule of 
A it generates may have an index > 2 in A. Therefore the definition of A,i(C, A) does not 
imply the existence of a Z-basis of A consisting in vectors of Xd(C, A)C. However it is known 
(see the remark after the corollary of Theorem VII in Chapter VIII of [Cas59]) that there is 
a Z-basis of A consisting in vectors of dXd(C, A)C (and this will be the main argument in the 
proof of Proposition 2.3, in §2.4). More generally, for any k G {1, . . . ,d} there exist linearly 
independent vectors e±, . . . , e k G kX k (C, A)C n A such that Zei + . . . + Ze k is saturated in A 
(that is, Span R (ei, . . . , e k ) D A = Zei + . . . + Ze^), so that e\, . . . , e k are the first k vectors of 
a basis of A. 

Our main tool in the proof of Theorem 2.1 is a duality result, namely Theorem VI in 
Chapter VIII of [Cas59], §5. Let C and A be as above. The polar (or dual) convex body C* of 
C is the set of all x G V such that, for any y G C, x ■ y < 1 (where • is the scalar product on V); 
see for instance Chapter IV, §3.3 of [Cas59]. Then C* is also a convex compact subset of V, 
symmetric with respect to the origin, with positive volume. The polar (or dual, or reciprocal) 
lattice A* of A is the set of all x G V such that, for any y G A, x ■ y G Z; see for instance 
Chapter I, §5 of [Cas59]. Then Theorem VI in Chapter VIII of [Cas59], §5, asserts that 

1< X k (C,A)X d+1 ^ k (C*,A*)<d\ (2.7) 

for any k G {1, . . . , d}. The constants are not important in the applications we have in mind, 
so this result means that X k (C,A) is essentially equal to l/Xd+i~ k (C*, A*). With dl 2 instead 
of dl in the upper bound (which does not matter for our purposes), it has been proved by 
Mahler [Mah39] (see also [Mah55]) after an earlier result of Riesz [Rie37]. More recent results 
show that the constant d\ can be in fact improved to Cd(l + log d) for some universal constant 
C (see [Ban96], the optimal constant is expected to be Cd). 

Let A be a lattice, and (ei, . . . , e^) be a Z-basis of A. Then the determinant of (ei, . . . , e<j) 
with respect to an orthonormal basis of F depends, in absolute value, neither on (ei, . . . , e^) 
nor on the chosen orthonormal basis. Its absolute value is the determinant of A, denoted by 
det A. In general the theory of successive minima is introduced in M. d , so that the orthonormal 
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basis can be chosen to be the canonical one. If A* denotes the dual lattice as above, then 
det A* = (det A) -1 (see Lemma 5 in Chapter I, §5 of [Cas59]). 

Even though we won't use it directly in this paper, it seems important to recall Minkowski's 
theorem 

2 d det(A) X AN 2 d det(A) 

,71 ur\ - II A fc C ' A ^ TIFT 

a!vol(C) vol(C) 

where vol(C) is the volume of C (see Chapter VIII, §1 of [Cas59]); this result is used in the 
proof of (2.7). Using (2.3) it implies 

Ai(C,A) d vol(C) < 2 d det(A). 

Accordingly, if vol(C) > 2 d det(A) then Ai(C, A) < 1 so that C n A / {0}: this consequence is 
known as Minkowski's first theorem on convex bodies. 



2.4 Proof of the Diophantine results 

In this section we relate the functions and ^>' k defined in §2.1 to successive minima, and 
apply classical results recalled in §2.3 to prove Theorem 2.1 stated in §2.1, and also that 
^k(Q) exists for any Q > 1. At the end of this section we prove also Proposition 2.3, using 
the same tools. 

With the notation of §2.1, for any Q, K > we denote by C(Q, K) the set of all x G F 
such that (2.1) hold, that is 

C(Q,K) = {x G F | \x\ < K, \x a - a x\ < Q' 1 }. 

This is a compact convex subset, symmetric with respect to the origin, in the vector space 
F equipped with the Euclidean structure induced from the canonical one on W 1 . It has a 
positive volume (see Lemma 4 in Appendix B of [Cas57]). 

We let also A = Z n n F. By definition, F has a basis consisting in vectors of Q™. 
Multiplying these vectors by a common denominator of their coordinates yields a basis of 
F consisting in vectors of Z n . Therefore the Z-module A = IT 1 n F has rank at least d; since 
it is discrete, it has rank d and it is a lattice in F. 

Now for any k G {1, . . . , d} and any Q > 1 we have 

fc (Q) = ini{K > 0, \ k (C(Q, K),A) < 1}, (2.8) 

provided it is not the infimum of an empty set (and we shall prove this below). 

To obtain an analogous property for ^' k , for any Q,K > we denote by C'(Q,K) the set 
of all x G F such that (2.2) hold, that is 

C'(Q, K) = {x G F | |x| < Q, \x-a\< K" 1 }. 

Then C'(Q,K) is also a compact convex subset, symmetric with respect to the origin, with 
positive volume; and we have for any k G {1, . . . , d} and any Q > C' k : 

%(Q) = sup{K > 0, X k (C'(Q, K), A) < 1}. (2.9) 

We will deduce Theorem 2.1 from (2.7) (see §2.3); with this aim in mind, we denote by 
A* the lattice dual to A, and by C*(Q, K) the convex body dual to C(Q, K). The main step 
is the following lemma. 
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Lemma 2.6. We have 

A C A* C (det A) ~ 2 A 
and, for any Q,K > such that 2\a\QK > 1, 

(n + lactic' (Q,K) CC*(Q,K) c C {2\a\Q M' 1 K) . 

In general A* is not equal to A, so that (det A)" 2 can not be replaced with 1. However it 
can probably be replaced with another function of A (see the proof below) . 

Proof of Lemma 2.6. Since A C Z n , we have x ■ y G Z for any i,!/ £ A so that A C A*. 
Therefore A is a sub-lattice of A*, so that there exists a Z-basis (v i, . . . , Vd) of A* and positive 
integers li, . . . ,£d such that (livi, . . . ,£dVd) is a Z-basis of A. Then i\ . . . id is the index of 
A in A*, that is the absolute value of the determinant of {t\V\, . . . ,£d v d) with respect to the 
basis (vi, . . . ,Vd). Since (iivi, . . . ,£dVd) (respectively (v±, . . . ,Vd)) has determinant equal (up 
to a sign) to det A (respectively det A* = (det A) -1 ) in an orthonormal basis of F, the index 
i\...id is equal to (det A) 2 . Therefore all integers £j are divisors of (det A) 2 , and we have 
(detA) 2 A* C A. 

Let y £ C*(Q,K), so that 

Vx£C(Q,K), \x-y\<l (2.10) 

by applying the definition of a dual convex body to both x and —x, since C(Q, K) is symmetric 
with respect to 0. Since |a|~ 1 Ka = ( | ck| —1 T^cko, • • • , M" 1 Ka n -i) G C(Q, K) because a G F, 
(2.10) yields \y ■ a\ < \a\K~ 1 . On the other hand, if y ^ then (2Q\y\ \a\)- l y G C(Q,K) 
since 2\a\QK > 1, so that (2.10) yields \y\ 2 < y ■ y < 2Q\y\ \a\ and therefore \y\ < 2\a\Q. 
This concludes the proof that C*(Q,K) C C'(2\a\Q, \a\~ l K). 
Now let y G C'(Q,K) and x G C(Q,K). Then we have 



\a x ■ y\ 



(a x - x a) ■ y + x a ■ y < n\a x - x a\ \y\ + \x\ \a ■ y\ < —Q + K— = n + 1, 



so that (n + 1) 1 \a\y G C*(Q,K) since \a\ = |cko | , thereby concluding the proof of Lemma 
2.6. □ 

Proof of Theorem 2.1. First of all, let us notice that for any positive real number fi and any 
Q, K > we have 

Uc(Q,K)=C( f i- 1 Q, f iK), 
\^C'(Q,K)=C'(pQ^- 1 K). 

Using (2.5), these equalities imply 

(\ k (C(n- 1 Q, f iK),A)=n~ 1 \ k (C(Q,K),A), 
\x k (C'(fiQ^- 1 K),A)=fi- 1 X k (C'(Q,K),A) 

for any k G {1, . . . , d} and any \i > 0. 

We shall prove now, at the same time, that ^ k (Q) exists and that Theorem 2.1 holds. Let 
k G {1, . . . , d}, and Q > max(l, (n + 2)C k \a\~ l ). If *& k (Q) exists, for any e > sufficiently 
small (2.8) yields 

i\ k (C(Q,(l-e)* fc (Q)),A) > 1, 
\\ k (C(Q,(l + e)* fc (g)),A) < 1. 
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If ^k(Q) does not exist, then the first inequality is valid when (1 — e)^ k (Q) is replaced with 
any positive real number, and in what follows the first inequality in each pair will still be 
valid in this case. 

The main duality result from geometry of numbers, namely (2.7), enables one to deduce 
that 

\\d+i-k (C*(Q, (1 - e)tffc(Q)), A*) < d\, 
[Xd+i-k (C*(Q, (1 + e)¥fc(Q)), A*) > 1 

Using (2.4) and the fact that 2|a|(l - e)^ k {Q)Q > {n + 2)C' k V k (Q) > 1 for e < 1/2 (because 
ty k takes positive integer values), Lemma 2.6 yields 

f A d+1 _, (C'(2\a\Q, (1 - e)\a\-^ k (Q)), (det A)~ 2 A) < d\, 
\A d+1 _4(n + l)-Vim(l + e)^(Q)),A) > 1. 

Using (2.6) and (2.11) we obtain 

fA d+1 _ fc (C / (2|a|Q,(l-e)|a|- 1 vI/ fc (Q)),A) < (detA) 2 d!, 
[Xd+i-k {C'((n + irVlQ, (1 + e)(n + l)\a\-^ k (Q)), A) > 1. 

Now (2.12) implies 

fA d+1 _ fc (C / (2|a|(detA) 2 d!Q,((detA) 2 d!|a|)- 1 (l-e)^(Q)),A) < 1, 
\\ d+1 _ k (C'(((l + e)(n + l))- 1 ^^, (1 + ef{n + l)\a\-^ k (Q)), A) > 1 + e. 

Since for any Q' > 1, the set of K > such that \d+i-k(C'(Q' , K), A) < 1 is an interval 
between and ^f' d+1 _ k (Q'), we obtain 

f((detA) 2 d!|«|)- 1 (l-e)^ fc (Q)<^ +1 _ fc (2|a|(detA) 2 d!Q), 
1(1 + ef{n + l)\a\-^ k (Q) > K+i-M 1 + + 1)) _1 HQ)- 

The first inequality if false when (1 — e)^ k (Q) is replaced with a sufficiently large real number: 
this concludes the proof that ^ k (Q) exists. Moreover, by letting e tend to zero (so that 
(1 + e)(n + 1) < n + 2), these inequalities conclude also the proof of Theorem 2.1. □ 

Proof of Proposition 2.3. Using (2.12) and the fact that the infimum in the definition of ^ d (Q) 
is attained, we have 

dXdiCid^Q^d^diQ))^) = \ d (C(Q,V d (Q)),A) < 1 

so that there exists a basis of A contained in C(d~ 1 Q,d^ d (Q)) (using the remark after the 
corollary of Theorem VII in Chapter VIII of [Cas59], recalled in §2.3). Let us denote by 
xi,...,x d this basis, and fix j € {1, . . . , d}. Since xj € C(d^ 1 Q, d^ d (Q)), if we write Xj = 
(x jfi , ■ ■ ■ ,x jt n-i), we have 

i\ Xj \<d* d m (213) 

I \xjfid. — ctQXj\ < dQ . 

We claim that Xj^ ^ 0. Indeed, if Xj^ = 0, then the second inequality in (2.13) gives 
|^j,jCio| < dQ^ 1 for all i € {1, . . . , n — 1}. As \a\ = \oto\, this implies that \xj^\ < d{\a\Q)~ l 
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but since Q > d|a| _1 by assumption, this gives \xj^\ < 1. Now Xjj <G Z, so this necessarily 
implies Xji = and together with Xj t o = 0, the vector Xj has to be zero. This contradicts the 
fact that Xj is an element of a basis, and proves the claim. Now we can define ujj = xj^a^xj, 
then ojj is Tj-periodic with Tj = xj^a^ 1 , and replacing Xj by — Xj if necessary, we can assume 
Tj > 0. Since TjOjj = xj, this proves the first part of the statement. Then from the second 
inequality in (2.13), we have 

\TjCt — TjUjj \ = \xjfia^ l a — Xj \ < d(|a|Q) _1 , 

hence 

\a — ojj\ < ddalTjQ)^ 1 
and from the first inequality in (2.13), 

which gives the second part of the statement. This concludes the proof. □ 
2.5 Minimal families 

In this section we prove Proposition 2.2 by interpreting the functions Vl/fc and in the spirit 
of the sequence of minimal points introduced by Davenport and Schmidt (see [DS67], §3 or 
[DS69], §3 and §7), even though a direct proof would be possible. This interpretation seems 
interesting in itself, and it will be useful to compute these functions explicitly in the case 
n = 2 (see §2.6). 

Proof of Proposition 2.2. Let k G {l,...,d}, and denote by the (countable) set of all 
families e = (ei,...,efc) consisting in k linearly independent vectors in Z n n F. Letting 
e P = (e Pj0 , • • • , e v<n -\) for any p € {1, ... , k}, we put 

lei = max max \e v j\. 

I<p<k0<j<n-1 F,J 

We fix an ordering on Ek (given by a bijective map of to N) such that if |e| < |e'j, then 
e comes before e'. To study the function ^fc, we let 

•nig) = max max \e pflaj - e pJ a \. 

l<p<k l<j<n— 1 

Notice that for any e G 4j |e| is a positive integer and 77(e) is a positive real number; there 
exist values of 77(e) arbitrarily close to (because ^k(Q) exists for any Q > 1). 

Recall from Definition 2 (§2.1) that Cj(. is the least value of |e| with e G For any 
real X > Cj(. we consider the set of e € such that |e| < X. Among this finite non-empty 
set, we consider the subset consisting in all e for which 77(e) takes its minimal value. We 
call minimal family corresponding to X the element in this subset which comes first in the 
ordering we have fixed on The choice of this ordering is not important, but choosing one 
in advance enables us to make consistent choices (as X varies): for instance, changing some 
e p into — e p , or making a permutation of e±, . . . ,e&, leaves |e| and 77(e) unchanged. Anyway 
the most important for us will be |e| and 77(e) for minimal families e, and these values do not 
depend on the ordering we choose. 
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Let Tk be the set of all minimal families e (corresponding to some X > C' k ) such that 
77(e) < 1. Then is infinite, countable, and for any distinct e, e' € J-jfe we have |e| 7^ |e'|. 
Let (e^>i denote the sequence of all elements of J^, ordered in such a way that the integer 
sequence (|e^|)^>i increases to +00. Then is the minimal family corresponding to all X in 
the range |e^| < X < |e^ + i|, and the real sequence (?7(e^))^>i decreases to 0. 

For any Q such that ^(e^) -1 < Q < r/i^e+i)^ 1 with £ > 1, we have ^k(Q) = because 
the infimum in the definition of *S>k(Q) is attained by the family e^ +1 . In the same way, we 
have ^k(Q) = \§i\ for any Q such that 1 < Q < ??(e 1 ) _1 . Therefore is constant on each 
interval ]^(e^) _1 , ??(e£ +1 ) _1 ] (and also on [1, ^(e^) -1 ]); it is left-continuous, and its values are 
positive integers. 

A similar interpretation holds for ^' fc , letting 



for any e = (ei, . . . , eu) € For any real X > Cj(. we consider the set of e S 4 such that 
|e| < X. Among this finite set, we focus on the subset consisting in all e for which 7/ (e) takes 
its minimal value. Within this subset, we call minimal family corresponding to X the one 
which comes first in the ordering we have fixed on £ k . Of course the minimal families here are 
not the same as the ones above (which are defined in terms of rj). Since the minimal families 
(corresponding to some X > C' k ) make up an infinite countable set such that |e| 7^ |e'| as 
soon as e 7^ e', they can be ordered in a sequence ((e^D^i with the following properties: the 
integer sequence (|e^|)£>i increases to +00; the real sequence (rf(^))e>i decreases to 0; e' e is 
the minimal family corresponding to all X in the range |e^| < X < \e' e+1 \. 

For any Q such that |e^| < Q < \e' e+1 \ with I > 1, we have W' k (Q) = vi^'e)^ 1 because the 
supremum in the definition of fy' k (Q) is attained by the family e' e . Therefore is constant 
on each interval [\e' e \, \e' e+1 \[; it is right-continuous, and its points of discontinuity are positive 
integers. This concludes the proof of Proposition 2.2. □ 

2.6 The case n = 2 

Let us focus now on the case of just one number, in which the functions and ^>' k can be 
made explicit in terms of continued fractions. Eventhough this section won't be used in the 
rest of the paper, we think it is an instructive example. In precise terms, we take n = 2, 
a = 1 and a x = £ with £ G R \ Q and < £ < 1. We have d = 2 and F = M 2 . 

We denote by Pj/qj the j-th convergent in the continued fraction expansion of £, with 
Po = 0, qo = 1, Pj+i = cij+iPj + Pj-i an d q-j+i = a-j+iqj + qj-i for any j > 1 where a,j is 
the j-th partial quotient (see for instance [Sch80] or [HW54]). We also need to consider the 
semi-convergents defined for any j > 1 and any integer t with < t < aj + \ by 




= max e„ • a 

l<p<k 



qj,t = tqj + qj-i, Pj,t = tpj + Pj-i- 



Notice that 



qjfi = qj-i, Pj,o=Pj-i, 



qj,a j+1 — Pj,a j+1 — Pj+li 



and that for any t £ {0, . . . , Oj+i}: 



\Vj,tt ~ Pj,t\ = \qj-l( ~ Pj-l\ ~ ~Pj\ = (Oj+l + £j - t)\qjt - Pj\ > 



(2.14) 
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since the sign of q n ^ — p n is that of (— l) n ; here £j G]0, 1[ is such that £j\qj£ — Pj\ = |<7j+i£ — 
Pj+i\- In particular this quantity is a decreasing function of t £ {0, . . . , ttj+i}, and we have 
\Qj,t£ ~ Pj,t\ > — Pj\ if i < cij+i- Let us recall also that 

Qj+i < \0jt ~ VjY 1 < Qj+i + Qj = < 2qj+i- 

These relations will be used repeatedly below, without explicit reference. 

To compute the functions Vl/^ and ^' k , we recall that for any Q > 1 there is an integer j, 
and only one, such that qj < Q < qj+i- There is also a unique t such that < t < aj+i and 
Qj,t < Q < Qj,t+i- 

Proposition 2.7. For any j and any Q > 1 we have: 

= Qj, \qj-i€ -Pj-iT 1 < Q < \qjZ-Pj\~ 1 , 

tejA -PjM^ 1 <Q < l<?j,m£ -Pj.t+il -1 ) o <t < aj+i - 2, 
\qj, aj+1 -iZ -Pw+i-il" 1 < Q < \QjZ-Pj\~ 1 , 

*i(Q) = ki? -PiT 1 , qj <Q < Qj+i, 

\Vj-iZ-Pj-i\~ 1 , Qj<Q<Qj,i, 
\Qj,ti -Pj,t\~ X , Qj,t <Q< Qj,t+i, 1 < * < - 1- 
These functions satisfy the following inequalities for any Q > 1: 




^*' 2 (Q/3) < *i(Q) < *' 2 (Q), ^*i(Q/3) < * 2 (Q) < 

The lower bound for is essentially the same as the one provided by Theorem 2.1 in 
this case (see the remark after the statement of this result). Of course the proof is different 
(and we find it instructive) because it relies on the explicit determination of the functions. 
By following this proof it is not difficult to construct examples of numbers £ for which this 
lower bound is essentially an equality for infinitely many Q. 

On the other hand, the upper bound on ^ refines upon the one provided by Theorem 
2.1, and can also be seen to be essentially an equality for infinitely many Q, if £ is properly 
chosen. 

Proof of Proposition 2. 7. We compute Vl/^ and Vl/l by determining the sequences of minimal 
families defined in §2.5. Let us start by computing and that is by taking k = 1. For 
a non-zero vector e = e\ = (ei^e^i) € Z 2 we have |e| = maxde^ol, | ei , i [ ) , 

= |ei, £ - ei,i|, r/(e) = |ei j0 + ei,i£|. 

It is a classical property of continued fractions that ((qe,Pe))e>o is the sequence of minimal 
families defined in terms of r\ and related to ^i, if the ordering on E\ is chosen properly. In 
the same way ((pe, —qe))e>o is the one related to These facts lead immediately to the 
expressions for and ^ in Proposition 2.7 (see the end of §2.5). 

The case k = 2 is slightly more complicated. We let 

£.j,t = ((Qj,Pj),(Qj,t,Pj,t))- 
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In particular e_j consists in two consecutive convergents; so does ej j0 , which is equal to 
§.j+i,o U P to permuting the two vectors. The determinant of the vectors ( q J. ) and ( ) is 
equal to that of ( q J. ) and ( q J~_\ ), so that it is ±1: the vectors in the family e^ t make up a 
basis of 1? . 
We have 

'Ij.i = tQj + Q3-1 



if t > 1 and | e^ 1 = qj , whereas 



^(e j)t ) = |ty-,t£ -Pj,t| = \qj-it -Pj-i\ ~ t\qj£-Pj\ > 

if t < a j+1 - 1 and t](e jta . +l ) = \qj£ - Pj\- 

Let us prove that (up to changing the ordering we have fixed on 82) the sequence (e^>i 
of minimal families constructed in §2.5 is exactly the sequence of families ej t with j > 1 and 
< t < a,j+i — 1, indexed in such a way that \ej t \ is increasing. With this aim in view, let 
X > 1 and denote by e = (ei,e2) the minimal family corresponding to X. Let j > 1 and 
t € {0, . . . , Oj + i — 1} be such that |e^ t | < X < |e J;t+1 1 (that is, max(gj, gj jt ) < X < <7j,t+i)- Let 
e = (q,p) denote either e\ or e2- We have |g| < X < qj + i and |g£— p\ < v(.§.j,t) ^ l<Zj-i£ — Pj-i 
so that: 



det 








det 




p Pj _ 









< \q\ 



q£,-p qji-Pj 
kj€ - I + - P| < ~Pj\+ QjlQj-it ~ Pj-i I < 2 



so that this determinant is equal to —1, 0, or 1. Since e^ t is a basis of Z 2 , this yields 
e = (<?,p) = a{qj,Pj) + /3(qj,t,Pj,t) with a € Z and /3 G {-1,0,1}. Changing (q,p) into 
(— g, —p) if necessary (which follows from a suitable change in the ordering on 82), we may 
assume that e = ce(qj,Pj) with a > 1 if /3 = 0, and /3 = 1 otherwise. In the latter case we have 
<? = 9j-i + (i+a)?j with a < (since \q\ < X < qj tt +i), t+a > (since \q(,— p\ < Pj-i\, 
using the fact that qj-iS, — pj-i and qj^ — pj have opposite signs), and finally a = because 
— p\ < v(£j,t) = \lj,tC ~ Pj,t\- Therefore we have proved that (up to permuting the two 
vectors e\ and e2 which make up the family e) we have e± = a(qj,pj) with a > 1 and 
e 2 = (qj,t,Pj,t)- Since r/(e) < i](&j,t) we deduce that r/(e) = rj(ej t ). Now e comes before ej t 
in the ordering on 82 (by definition of a minimal family), so that |e| < \ej t \ (thanks to the 
assumption we make, throughout the paper, on the orderings we choose on 8k)- If t = this 



1 and e 



e_j t . If i > 1 then a might be different from 1, but since |e| 



-j,t\ 



implies a 

and 77(e) = v((Lj,t) we ma y change the ordering on 82 in such a way that e^ t comes first, 
so that e = e_j t in this case too. This concludes the proof that ej t is the minimal family 
corresponding to X. 

The values of ^(Q) follow immediately (see the end of §2.5). The proof is similar for 
^' 2 (Q): the sequence of minimal families defined in terms of 7/ and corresponding to is 
given by the families ((pj, —qj), (Pj,t, —Qj,t)) if the ordering on 82 is appropriate. 

To conclude the proof of Proposition 2.7, let us compare Vl/^ and ^3_j., starting with the 
inequality ^i(Q) < ^' 2 {Q)- Let j be such that |<7j-i£ — Pj~\\~ x < Q < \qj£ — Pj\ _1 ; then 
^i(Q) = qj- Since *' 2 i s non-decreasing and \qj-i£ — Pj-il^ 1 < qj,i, we may assume that 
|<£,_i£ — Pj-il^ 1 < Q < qj,i so that ^^{Q) = |<7j-i£ ~ Pj-i\~ X an d the conclusion follows. 
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Let us prove in the same way that ^(Q) < ^i(Q)- Let j be such that qj < Q < qj+i, so 
that *&i(Q) = \qjC — Pj\ l - Since Q < qj + i < \qj£ — Pjl^ 1 and ^2 is non-decreasing, we have 

* 2 (<3)<*2(i^-^r 1 ) = ^ + i<*i(<3)- 

The remaining two inequalities are slightly less straightforward. We shall begin with the 
following facts: 

\Qj,a j+1 /3^ ~ Pj^+^r 1 < %Qj (2-15) 

and 

9j,(o j+ i+i)/3 > ^\Q^-Pj\~\ (2-16) 

where qjj and pjj are defined by tqj + qj-\ and tpj + Pj-i even if t is not an integer. 
To prove (2.15), we write using (2.14): 

3qj\qj,a j+1 /3t-Pj,a j+1 /3\ = <lj'^'j \ ■ ^j) Qji Pj > 2<lj(aj+i+£j)\<ljt-Pj\ = ^Qj\Qj-i^~Pj-i\ > 1, 
whereas (2.16) follows from 

fat - PjT 1 < + Qj = Qj-i + ( a i+i + l )Qj < 3 ^j-i + ° J+1 3 + - g^ • 

Let us deduce now that ^' 2 (Q/3) < ^i(Q) for any Q. Let j be such that \qj-i£, — Pj-il^ 1 < 
Q < \qj£, — Pj\~ l 1 so that \&i(Q) = g^. Since ^ is non-decreasing we may assume Q = 
\qj£ — Pj\~ l ■ Let t denote the integer part of ctj+i/3. Then (2.16) yields Q/3 < <Zj,t+i so that, 
using (2.15) and the fact that \P 2 is non-decreasing: 

*' 2 (Q/3) < 1^-Pi.tl" 1 < % = 3*i (Q) 

if i > 1, and 

* 2 (<3/3) < |<fc-i£ < 2 9j = 2*i(Q) 

if t = 0. This concludes the proof that ±*' 2 (Q/3) < *i(Q). 

Let us prove now, along the same lines, that < ^(Q)- Let j be such that 

Qj < Q/3 < then ^(Q/3) = \qj£ — Pj\~ x - Since ^2 is non-decreasing we may assume 

that Q = 3qj. Letting t denote the integer part of a J+ i/3, (2.15) yields \qj,t€ ~ PjA~ l < Q- 
If t < ctj+i - 2 we obtain * 2 (Q) > Qj,t+i > |*i(<5/3) using (2.16); if t = a j+1 - 1 then 
*2(Q) > > |*i(Q/3)- In both cases this concludes the proof of Proposition 2.7. □ 



3 Applications to the KAM and Nekhoroshev theorems 

Based on the results of the previous section, we describe now a new approach to the pertur- 
bation theory for quasi-periodic solutions. For simplicity, we will restrict to perturbations of 
constant vector fields on the n-dimensional torus, but clearly the approach can be extended 
to other situations. We will give stability results both in "finite" and "infinite" time, which 
correspond respectively to the construction of a "partial" normal form, for any 1 < d < n, 
and to an "inverted" normal form for d = n. The first result corresponds to a Nekhoroshev 
type theorem, while the second result is the KAM theorem for vector fields. 

We state our main stability results, Theorem 3.1 and Theorem 3.3, in §3.1. Then in §3.2 
we prove a periodic averaging result (Lemma 3.4), which will be the only analytical tool in our 
proofs. A quasi-periodic averaging result (Lemma 3.5) will then be easily obtained from the 
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periodic averaging result through approximation by periodic vectors (Proposition 2.3 in §2.2), 
and this constitutes the main novelty in our proofs. Finally, Theorem 3.1 and Theorem 3.3 will 
be proved respectively in §3.4 and §3.5 by applying inductively the quasi-periodic averaging 
lemma: the proof of Theorem 3.1 consists of a finite iteration and is direct while the proof of 
Theorem 3.1 relies on an infinite induction and is more implicit. 

3.1 Main results 

Let us first describe the setting. Let n > 2, T n = R n /Z n and = C n /Z n . For z = 
(zi,...,z n ) € C n , we define \z\ = maxi<j< n \zi\, and given s > 0, we define a complex 
neighbourhood of T n in by 

1? = {0 6 Tg | |lm(0)l < «}• 
Given a € M. n \ {0}, we will consider bounded real-analytic vector fields on , of the form 
X = X a + P, X a = a, \P\ S = sup \P(z)\ < e. (*) 

By real-analytic, we mean that the vector field is analytic and is real valued for real arguments. 
For any such vector field Y, we shall denote by Y t its time-i map for values of t € C which 
makes sense, and given another vector field Z, we denote by [Y, Z] their Lie bracket. Moreover, 
for a real-analytic embedding 3> : T™ — s- Tg, r < s, and for a real-analytic vector field Y which 
is well-defined on the image of <3?, we let &*Y be the pull-back Y, which is well-defined on T". 

Finally, if d is the number of effective frequencies of a, we denote by ^Sf a = ^ the function 
defined in §2.1, Definition 1. From Proposition 2.2, ^ a is left-continuous and non-decreasing, 
hence the function 

A a : [1, +oo[^ [1, +oo[, A Q (Q) = Q* a (Q) (3.1) 
is left-continuous and increasing. Therefore it has a generalized inverse 

A; : [l,+oo[->- [l,+oo[, A* (x) = sup{Q > 1 | A a (Q) < x} (3.2) 

which satisfies A*(A a (Q)) = Q and A a (A*(x)) < x. Moreover, A* is both non-decreasing 
and continuous. 

Our first result is the following. 
Theorem 3.1. Let X = X a + P be as in (*), define 

Q(e) = A* a ((2s)- 1 ) , Kd , a = 2 & d\2 d - l)\a\-\ 

and assume that 

Q{e) > max(l,d|ar 1 ), Q(e) > k^s" 1 . (3.3) 
Then there exists a real- analytic embedding <I> : T™^ 2 — > T™ such that 

$*X = X a + N + R, [X a ,N} = 0, 

with the estimates |$ — Id| s / 2 < d 2 |a| _1 Q(£) _1 and 

\N\ s/2 < 2e, \R\ s/2 < 2eexp (-(In 2)n^sQ{ej) . 
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The theorem states that the perturbed vector field X = X a + P can be analytically 
conjugate to a "partial" normal form X a + N + R, that is a normal form X a + N where 
N commutes with X a , plus a "small" remainder R. In general, this remainder cannot be 
equal to zero, regardless of the Diophantine properties of a. This theorem will be proved by 
applying finitely many steps of averaging, as usual. The novelty here is that each of this step, 
which in all other proofs require to estimate small divisors, will be replaced by d elementary 
steps in which we will only deal with periodic approximations. So it could be said that our 
proof will reduce the general case 1 < d < n to the special case d = 1. In the case d = 1, 
q is a periodic vector and its only periodic approximation is itself: the function ^ a (Q) is 
therefore constantly equals to its period T, and then Q(e) = \a\(2Te)^ 1 . 

Note that Theorem 3.1 implies a result of "stability" in finite time: one can compare the 
flow of X a + P to the simpler flow of X a + N, during an interval of time \t\ < T(e) where 
T(e) is essentially the inverse of the size of the remainder R. Then, one should probably be 
able to show that this time T(e) cannot be improved "uniformly" in general (see [Boul2]: 
the example there is given for d = n and for Hamiltonian vector fields, but the construction 
should extend to our setting). 

The smallness of the remainder is of course entirely tied up with the Diophantine properties 
of a. If A* grows very slowly, that is if ^f a grows very fast, the size of the remainder R, 
which is always strictly smaller than e, may not be much smaller. However, if a € X>J, that 
is if a is a Diophantine vector with exponent r > d — 1, then by Corollary 2.4, one can find 
two positive constants Q a > 1 and C a > such that for all Q > Q a , ^ a (Q) < C a Q T . The 
above corollary is therefore an immediate consequence of Theorem 3.1. 

Corollary 3.2. Let X = X a + P be as in (*), with a G V T d , r > d — 1, define 
E a = max(l, dH-\Q a ), K dyCt = 2 6 d 2 (2 d - l)\a\-\ 

and assume that 

e < (2C a )-\E a r^ l \ e < (2C Q )~ 1 (^^ 1 )" (r+1) - (3-4) 
Then there exists a real- analytic embedding <I> : T™^ 2 — >■ T" such that 

<b*X = X a + N + R, [X a , N] = 
with the estimates |<J> — Id| s / 2 < " H (2C Q £) X + T and 

\N\ s/2 < 2e, \R\ s/2 < 2eexp (-(ln2)K^s(2C a e) - ^) . 

For our second result, we have to impose two restrictions on our vector a. First we assume 
that it satisfies the following integral condition: 

La,^> <0 °- <A) 

where A* is the function defined in (3.2). Using Corollary 2.5, we will show in appendix A, 
Lemma A.l, that a satisfies this condition (A) if and only if a G Bj, that is condition (A) 
is equivalent to the Bruno-Riissmann condition. Furthermore, we have to restrict to the case 
d = n: under such an assumption, any real-analytic vector field ./V such that [N, X a ] = is 
constant, that is N = Xp for some (3 £ C n . 
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Theorem 3.3. Let X = X a + P be as in (*), with a satisfying condition (A) and d = n, 
define 




dx 



2 6 d 2 (2 d -l)\a 



-i 



A a (Q(s)) zA*(z)' 



K d,a — 



and assume that 



Q(e) > max(l,d|a| x ), r(e) < n d ^s. 



(3.5) 



Then there exist a unique constant (3 € C™ and a real- analytic embedding <E> : T™^ 2 — > T™ swc/j 
that 



The above theorem states that by adding a "modifying term" Xp (in the terminology of 
Moser, [Mos67]) to the perturbed vector field X = X a + P, the modified perturbed vector 
field X + Xj3 = X a+/ 3 + P can be analytically conjugated to X a . In view of Theorem 3.1 
in the case d = n, the result says that if we modify the original perturbed vector field by 
adding a vector field N such that [N, X a ], then the modified perturbed vector field can be 
analytically conjugate to X a . Hence such a statement can be called an "inverted" normal 
form, and it implies a result of "stability" in infinite times (but not for the original vector 
field). Theorem 3.3 is exactly the classical KAM theorem for constant vector fields on the 
torus first proved by Arnold ([Arn61]) and Moser ([Mos66]). The proof we will give here is 
very close to the proof of Theorem 3.1, replacing a finite iteration by an infinite iteration, the 
latter being possible by the restrictions imposed on a. As for the proof of Theorem 3.1, only 
the proof of the averaging step is new, the induction is then essentially classical. Also, as in 
[RuslO] or [Posll], the speed of convergence in the induction process is linear, and we can 
artificially prescribed any rate of convergence < c < 1 without any difficulties (but for the 
statement we simply chose c = 1/2). 

Now let us briefly discuss the restrictions we imposed on a. First an arithmetic condition 
is known to be necessary in the case n = 2, and the condition we used is known to be optimal 
by a result of Yoccoz (see [Yoc02] for instance, where the discrete version of the problem, 
which is the analytic linearization of circle diffeomorphisms, is considered). However, for 
n > 3, nothing is known, and the flexibility in the proof might suggest that condition (A) in 
Theorem 3.3 is perhaps not "optimal", as trying to get rid of any artificial choice in the proof 
might lead to a perhaps weaker arithmetical condition (and the speed of convergence in the 
iteration might be sub-linear). Finally, we had to restrict also to d = n, and one may ask if 
the statement remains true for any 1 < d < n, in the sense that there exists a real-analytic 
vector field ./V which commutes with X a such that X + N can be analytically conjugated to 



3.2 Periodic averaging 

In this paragraph, we will prove a result about periodic averaging. We consider a periodic 
frequency oo € M n \ {0} with minimal period T, and let X^ = oj. Recall that averaging along 
such a periodic frequency, no "small divisors" arise, one can solve the associated homological 
equation by a simple integral formula without expanding in Fourier series and cutting Fourier 



<S>*{X + X p )=X a 



with the estimates 



$-Id| s/2 < S-^dVrMe), 101 <2e. 



X a . 
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modes. In terms of small divisors, one can notice that for all fc£Z n such that k-u / 0, then 
\k ■ uj\ > T _1 , but of course we will not use this point of view. 

Given w G W 1 \ {0}, we let X m = zu and given two parameters fx > and 5 > 0, we 
consider a real-analytic vector field on T" of the form 

Y = X U + X m + S + P, \w\<n, \S\ S <8, \P\ s <e. (3.6) 

Here X u is considered as unperturbed, P is our original perturbation and the vector fields X m 
and S are "parameters" which are free for the moment, but will be determined subsequently. 
The vector field X m represents a shift of frequency, and will be chosen in §3.3: more precisely, 
we will approximate our original vector a by a periodic vector u, and then choose w = a — lo 
so that X a = X w + X w . The vector field S can be ignored at this stage, it will become 
important only for the inductions later on in §3.4 and §3.5. 

Lemma 3.4. Consider Y as in (3.6), and for < <j < s, < b < 1, assume that 

e + 5 < fi, 2e < fi, 2 4 T / u<T 1 < b. (3.7) 

Then, setting 

[P}= f Po X l Tu] dt, V = T f (P-[P])o X l T Jdt 
Jo Jo 

the map V 1 : T"_ ? — >■ T™ is well-defined real-analytic embedding, and 

(V l )*Y = X L0 + X VU + S+[P] + P 

with the estimates 

IV 1 - Id| s _ ? < Te, \[P]\ s <e, \P\ s -,<be. 

Note that our condition (3.7) implies in particular that Te is sufficiently small with respect 
to <r, and at this stage this is essentially the only assumption we need (let us also note that 
the second part of the condition (3.7) is here just for convenience, as it will be implied by the 
first part). We artificially introduced other assumptions in order to prescribe already the size 
of the remainder P to be, for an arbitrary < b < 1. Note also that our vector field V, and 
therefore the transformation V 1 , is independent of the choice of X m and S. 

In the special case where X m = S = 0, this is just an averaging along the periodic flow 
generated by X w . From an analytical point of view, X^ induces a semi-simple linear differen- 
tial operator Cx u = [■ , Xj\ acting on the space of vector fields: [P] is just the projection of 
P onto the kernel of Cx u , hence [[P],X W ] = 0, then P — [P] lies in the range of £x„ and V is 
its unique pre-image, that is [V, Xj\ = P — [P]. From a geometrical point of view, let J- 1 be 
the one-dimensional foliation tangent to X w : since to is periodic, the leaves J- 1 are compact 
and diffeomorphic to T 1 . Then [P] is just the mean value of P on each leaf of J 71 , the vector 
field P — [P] has therefore zero mean so we can integrate it along each leaf of F 1 to obtain 
a periodic vector field V . The statement that there is no "small divisors" can be expressed 
by saying that the inverse operator of Cx^ (defined on the image of Cx w ) is bounded by T, 
which simply comes from the fact that the leaves of T 1 are compact. The proof of Lemma 3.4 
is essentially classical, but for completeness we give all the details, using technical estimates 
which are contained in Appendix B. 
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Proof of Lemma 3.4- Since \P\ S < e, then obviously \[P]\ S < £ and \V\ S < Te. From the 
second and third part of (3.7), we have in particular Te < hence by Lemma B.l, the map 
V 1 : T™_ ? — > T" is a well-defined real-analytic embedding and 

l^ 1 -Id| s _ f < |F| S <Te. 

Now we can write 

(y!)*y = (v 1 )*^ + (V 1 )*^ + 5 + P) (3.8) 

and using the general equality 

j t (V t )*F = (V t y[F,V] 

for an arbitrary vector field F, we can apply Taylor's formula with integral remainder to the 
right-hand side of (3.8), at order two for the first term and at order one for the second term, 
and we get 

(v r )*Y = x w +[x w , v]+ [\i-t)(v t )*[[x LJ ,v},v}dt+x vu +s+p+ f (v'nx^+s+p^dt. 

Jo Jo 

Now let us check that the equality [V, X w \ = P — [P] holds true: let us denote G = P — [P] 
and DV the differential of V, then since X w is a constant vector field, we have 

[V,X U ] =DV.uj = T f D{GoX t Tw ).utdt= ! D(G o X l Tu] ).Tujtdt 
Jo Jo 

so using the chain rule 

and an integration by parts 

[V, X u ] = (G o Jt|J ~ f Go X^dt = G, 

Jo 

where in the last equality, GoX^ = G since Tuj € Z n and the integral vanishes since [G] = 0. 
So using the equality [V, Xj[ = P — [P], that can be written as [X u , V] + P = [P], we have 

(V l yY = X w + X w + S + [P] + f\l - i)(V*)*[[X w , V], V]dt + f\v*)*[X w + S + P, V]dt, 

Jo Jo 

and if we set 

p t = tp + (i-t)[P], P = [ (y t )*[p t + x w + s,v]dt 

Jo 

and use again the equality [X u , V] = [P] — P we eventually obtain 

(V l )*Y = X U] + X m + S+[P] + P. 

It remains to estimate P, and for that let us write U = \X m + S + P, V] . Using the second 
and third part of (3.7), we have Te < 2 _1 T/i < < (8e) -1 ? = (4e)~V 2 and therefore by 
Lemma B.3, we can estimate 

\P\s-, < W\s-<;/2- (3-9) 
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Now the term U is just a sum of three Lie brackets, and as |P*| S < e, each of them can be 
estimated by Lemma B.2 and we obtain 

\U\ a -<:/2 < 4Te? -1 (e + \i + 6) < 8T<j _ Ve (3-10) 

where the last inequality uses the first part of (3.7). Now from (3.9), (3.10) and the last part 
of (3.7), we finally obtain 

|P| a _ 5 < 16T? _1 /ze < be 
which is the estimate we wanted. □ 



3.3 Quasi-periodic averaging 

Now we consider our original unperturbed vector field X a and a real-analytic vector field on 
T" of the form 

Z = X a + S + P, \S\ 8 <6, \P\ 8 <e, (3.11) 

which, when 5 = 0, corresponds to our vector field X as in (*). Recall that to a € R n \ {0} is 
associated a d-dimensional vector subspace F a C R n , which is the smallest rational subspace 
containing a, and a Z-module A a = Z n n F a of rank d. 

For a parameter Q > max(l, d|a| _1 ) to be chosen later, we will use Proposition 2.3 to find 
d periodic vectors ojj <G W l \ {0}, with periods Tj essentially bounded by ty a (Q), such that 
each ujj is essentially a Q-approximation of a and the integer vectors TjUj form a Z-basis of 
A a . Then applying inductively d times Lemma 3.4, we will obtain the following result. 

Lemma 3.5. Consider Z as in (3.11), and for Q > max(l, c^al -1 ), < a < s, and < c < 1, 
assume that 

QV a (Q)(e + 5)<l, 2Qtf a (Q)e<l, 2 4 d 2 (2 d - l)\a\- x Q' 1 a' 1 < c. (3.12) 
Then there exists a real analytic embedding $ : T"^ — » T" such that 

$*Z = X a + S + P + P + , P = [ PoX]<ii? 

with the estimates 

|$-Id| s _ CT < d 2 \a\- 1 ^! a {Q)e, \P\ S < e, \P + \ s -a < ce. 

Let be the d-dimensional linear foliation of W 1 given by W 1 = \_\ aeF ± F®, where 

F® = a + F a is the translate of F a by a G F^. If it : W l — >■ T n is the canonical projection, then 
n(F a ) = F a /A a is compact and diffeomorphic to T d , hence F d = Tt{F d ) is a d-dimensional 
foliation of T n with compact leaves diffeomorphic to T d . So in the above statement, dd is 
the Haar measure on the compact quotient group F a /A a and is the time-one map of the 
constant vector field X$ = Note that by Birkhoff's ergodic theorem, one can also express 
P in the more classical format 

P= lim - f PoX^dt 

s^+oo s Jo 

since each leaf of F d is invariant by X a , and the restriction of X a to each such leaf is uniquely 
ergodic (hence the above convergence is uniform). The vector field P can be equivalently 
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considered as being the projection of P onto the kernel of the operator Cx a = [ • , X a ] or the 
mean value of P along the leaves of As before, we will notice that the transformation 
constructed in the above lemma is independent of the choice of 5. 

In the special case where 5 = 0, this is just an averaging along the quasi-periodic flow 
generated by X a . In the classical approach, one tries to solve the equation [W, X a \ = P — P, 
or equivalently to integrate P — P along the leaves of the one-dimensional foliation tangent to 
X a , to find a coordinate transformation $ = W 1 . But then small divisors inevitably arise as 
the inverse operator of Cx a is now unbounded, since the leaves of the foliation defined by X a 
are non-compact. To overcome this difficulty, usually one introduce a parameter K > 1 and 
replace P by a polynomial approximation Pk obtained by cutting higher Fourier modes of P. 
The term P — Pk is then considered as an "error" and thus one solves only an "approximate" 
equation. 

Here we shall use a significantly different though "dual" approach, replacing the quasi- 
periodic flow X a by d independent approximating periodic flows X m , . . . , X u!d , which at each 
point will be tangent to the foliation F d More precisely, for a given Q sufficiently large, we 
will approximate a by d independent T^-periodic vectors ojj such that the integer vectors TjUj 
form a Z-basis of A a . Replacing a by ojj, the terms a — ujj will be considered as "errors" 
and we will solve d equations [Vj-,X W ] = Pj-i — [Pj~i}j successively, starting with Pq = P 
and choosing at each step Pj = [Pj-i]j, where denotes the average along the periodic 
flow of Uj. At each step, only the inverse operator of Cx^. = [• ,X Wj ] will be involved, and 
the latter is bounded by Tj. At the end, we will find a coordinate transformation $ as the 
composition of time-one maps o ■ ■ ■ o V d , even though we do not integrate along the flow 
of X a (that is we do not find a generating function of <&). Also, P d will be equal to the 
(space) average of P along the leaves of the foliation spanned by X Ul , . . . , X Ldd (which is the 
foliation T^) and therefore Pd is also the (time) average along the flow of X a . Note that the 
role of our "approximating" parameter Q > 1 is dual to the use of the parameter K > 1 in 
the classical approach, and by solving exactly the d equations we mentioned above we will 
essentially solves the classical approximate equation. 

Proof of Lemma 3.5. For a given Q > max(l, rf|ct| 1 ) , we apply Proposition 2.3: there exists 
d periodic vectors u\, . . . , uj^ of periods T\, . . . , Tj, such that T±ui, . . . , T^Wd form a Z-basis 
of A a and for j G {1, . . . , d}, 

\a - uj\ < d(|a|TjQ) -1 , H" 1 < Tj < \a\~ l d^ d {Q). 

Now we define Wj = a — ojj and fij = c/(|a|TjQ) _1 so that \wj\ < fij, and we set <; = d~ l a 
and b = (2 d - l) _1 c. Then < a < s and < c < 1 implies < ? < s and < b < 1, and for 
any j € {1, . . . , d}, by (3.12), the conditions 

e + 5 < uj, 2e < n j: 2 4 T j ^- 1 < b, (3.13) 

are satisfied. Set Pq = P and define inductively 

Pj = J Pj-i o x^.dt = [P^, j e {l, . . . , d}. 

Obviously, \Pj\ s < e and 

P d = j\..£poX t ^ i o...oX t ^ d dt 1 ...dt d = P 
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since 

{tiTiwi + • • • + t d T d u d | < tj < 1, j € {1, . . . , d}} C P Q 

is a fundamental domain of F a /A a . Now for j G {0, . . . , d}, set Sj = s — j?, then so = s and 
= s — a. Then we claim that for any j G {0, . . . , <i}, there exist a real analytic embedding 
&j : T™ -»• T£ such that 

= X Q + 5 + P, + P+ 

with the estimates 

\<S>j - ld\ Sj < dj|a|- 1 ^ a (Q)e, \P+\ 8 . < (2* - l)be. 

For j = 0, since Pq = P, letting $o be the identity and P + = 0, there is nothing to prove. So 
assume the statement is true for some j £ {0, . . . , d — 1}, and let us prove it remains true for 
j + 1. To do so, let us write, X a = X Uj + X mj so that by (3.13), Lemma 3.4 can be applied 
to the vector field 

Yj = ($*)Z -P+ = Xa + S + Pj = X Uj + X m] +S + P 3 

and if 

Vj+i = T j+1 J {Pj - P j+1 ) o . +lW . +1 tdt, 
then V^ +1 : T" — >• T™ is a well-defined analytic embedding for which 

(V^)*!^' = X Wj + + S + Pj+i + Pj + i = X a + 5 + Pj+i + Pj+i, 
with the estimate 

- Id| s , +1 < T j+1 e < d\a\~ 1 ^ a (Q)e, 
So we define = o and P+_ 1 = P j+1 + (^/ +1 )*P+ hence 

(<&* +1 )^ = X Q + S + P J+1 + P+ +1 . 
For the estimates, we write — Id = <frj o V^_i — V^ + i + ~~ ^ so ^at 

- Id| s , +1 < j*,- - Id| s . + - Id| s , +1 < d(j + l)\a\- 1 ^f a (Q)e 
and using Lemma B.3, we have 

\{Vl +i yP+\ Sj+1 <2\P+\ Sj <2{2i -\)be 

hence 

l^ik+i < be + 2{2? - \)be = (1 + 2(2* - l))be = (2^' +1 - l)be. 

So this proves the claim, and setting <3? = <3?d and P + = P^, as P d = P and b = (2 d — l)~ 1 c, 
this also proves the lemma. □ 
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3.4 Proof of Theorem 3.1 

We are finally ready to give the proof of Theorem 3.1, which consists in applying inductively 
Lemma 3.5 as many times as we can. For simplicity, this lemma will be applied with c = 1/2. 

Proof of Theorem 3.1. Let C d , a = 2 5 d 2 (2 d - l)|a| _1 and n d>a = 2C d , a . For m > 1 to be 
chosen below, and for i <G {0, . . . , m}, define 

i-1 

Ei = 2-% 7* = (1 - 2~')2e = ^ °i = (2m)- 1 s. 

k=0 

Note that £i + ji = 7 i+i < 2e for i € {0, . . . , m}. Let s_i = sq = s and define inductively 
Si = Si-i — o~i for i £ {1, • • • , m} so that s m = s/2. For Q > max(l, d|a| -1 ) to be chosen 
below, if the conditions 

QV a (Q)(£i + 7i) < 1, 2Q^ a (Q) £l < 1, Q^Q-V- 1 < 1 (3.14) 

are satisfied, then we claim that for all i € {0, ... , m}, there exists a real-analytic embedding 
¥ : T™ -> T™ such that 

(&)*X = X a + N l + P u [Ni, X a ] = 0, 

with the estimates 

\& - H| Si < d 2 |a| _1 * a (Q)7i, |iV i | Si _ 1 < 7i , \Pi\ Si < £i . 

Indeed, for i = 0, choosing <I> to be the identity, iVo = and Pq = P, there is nothing to 
prove. So assume the statement holds true for some i G {0, ... ,m — 1}, and let us prove it 
remains true for i + 1. By the induction hypothesis and (3.14), we can apply Lemma 3.5 
with c = 1/2 to Z = [¥)*X = X a + Ni + P u so with S = Ni and 6 = 7i , and we find a 
real-analytic embedding : — > T™ such that 

with the estimates 

- Id| ii+1 < d 2 |ar 1 * Q (Q)£ i , |P| Si < e u \P t + \s l+1 < 2- l Ei = e i+1 . 

Since £; + 7i = 7i+ i and [P7,X Q ] = 0, setting = o jV i+ i = iV * + P and 

Pj + i = Pj + , the statement obviously holds true for i + 1 and this proves the claim. Therefore 
setting $ = $ m : T™ /2 -»• T™, iV = iV m and R = P m , we obtain 

$*X = X Q + iV + P, [X Q ,AT] = 0, 

with the estimates 

|$ - Id| s/2 < d 2 \a\^ a {Q) lm < 2d 2 \a\- 1 * a (Q)e, 

and 

|iV| s/2 < |iVU__ 1 < 7m < 2e, \R\ s/2 <E m = 2~ m E. 
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Now it remains to choose our parameters Q > 1 and m > 1 in order to fulfil our as- 
sumption (3.14). First, Si + 7$ < 2e and 2ej < 2e for all i € {0, ...,m}, hence the 
first and second part of (3.14) are satisfied if 2Q^ a (Q)e < 1, and the latter is satisfied if 
A a (Q) = Q^a{Q) < (2e) _1 , that is if Q = A* ((2s)- 1 ). As for the third part of (3.14), 
since a~ l = 2ms" 1 the latter is satisfied if we choose m to be the largest integer smaller than 
(2Cd t a)~ 1 sQ = Kj^sQ. So eventually Q and m are chosen as follows: 

Q = A* a {(2s)- 1 ) , m = [(K d , a )- l sQ\ = [(k^)" 1 ^* (pe)" 1 )] . 

Our threshold (3.3) in the statement of Theorem 3.1 ensures that both Q > max (MM -1 ) 
and m > 1, and we have obtained 

$*A = X a + A + i2, [X a ,N] = 0, 

with the estimates 

1$ - Id| s/2 < 2d 2 \a\- 1 ^ a {Q)e < d 2 \a\- 1 Q- 1 
and since m > n^sQ — 1, 

\N\ s/2 < 2e, \R\ s/2 < 2e2~ K *> Q = 2eexp (- In 2{K d , a )- 1 sQ) . 
This was the statement to prove. □ 



3.5 Proof of Theorem 3.3 

First let us point out that even if we impose an arithmetical condition on a and we restrict 
to d = n, it seems very difficult to use the scheme of the proof of Theorem 3.1 to go from a 
finite iteration to an infinite iteration, that is to analytically conjugate X a + P to a normal 
form X a + TV, with [X a ,N] = 0. Of course, there is no problem at the formal level as one 
can construct a formal transformation and a formal vector field iVoo such that formally 
(&°°)*X = X a + iV^. However, because of the presence of the formal vector field iV^, 
the formal transformation <£°° cannot converge in general. A technical explanation in our 
situation goes as follows. To make infinitely many iterations in the previous scheme, instead 
of fixing a large Q with respect to a small e, one should consider an increasing sequence 
of Q m tending to infinity with respect to the decreasing sequence e m tending to zero. But 
then at each step we would have to apply Lemma 3.5 with 6 = j m and the first part of 
condition (3.14) would read Q m ^ a{Qm){^m + 7m) < 1- This condition cannot hold for all 
m £ N: Qm^a(Qm) has to tend to infinity, while e m + 7 m does not converge to zero (it is 
strictly bigger than e, and actually converges to 2e). 

Now we will explain how a slight modification of this scheme (and therefore of the expected 
result) will lead us to the proof of Theorem 3.3. First, for d = n, a vector field which 
commutes with X a has to be constant, that is N = Xp for some (3 G C n . Then the idea, 
which goes back to Arnold and which has been largely exploited by Moser (see [Mos67]), is 
that instead of trying to conjugate X = X a + P to a normal form X a + Xp, we will try to 
conjugate a modified vector field X + Xp = X a + Xp + P to the constant vector field X a . This 
simple change of point of view will allow us to apply the averaging procedure described in 
Lemma 3.5 infinitely many times and to prove the convergence easily. Technically speaking, 
at each step we will be able to apply Lemma 3.5 with 5 = 2e m (and not with 5 = j m as 
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we did before), and the condition Qm^ a{Qm){^m + 2e m ) < 1 can and will be fulfilled for all 
m 6 N. The last part of condition (3.14) will be used as 3> definition of <r m; namely a m will 
be essentially equal to Q^, and the arithmetic condition on a will ensure that the series of 
Q^, and therefore the series of a m , is finite and can be made as small as we wishes provided 
we choose Q = Qo sufficiently large. 

The only real difference with the scheme of the proof of Theorem 3.1, where everything 
was explicit, is that here the modifying vector field Xp cannot be constructed explicitly, its 
existence is obtained by an implicit argument (exactly like a fixed point whose existence is 
obtained by Picard or Newton iterations). The modifying vector field Xp cannot be deter- 
mined in advance, at each step of the iteration it is only known at a certain precision which 
increases with the number of steps, so that it is only at the end of iteration that (3 can be 
uniquely determined. To make things more precise, for a given r > 0, let 

B r (a) = {x e C n \\a-x\< r}. 

Then Lemma 3.5 can be recast as follows. 

Lemma 3.6. Let X be as in (*) with d = n, Q > max(l, d|a| _1 ), < a < s and assume that 

3Q^ a (Q)e < 1, 2 5 d 2 (2 d - l)\u\- 1 Q~ l a' 1 < 1. (3.15) 

Then there exist an embedding if : B £ (a) — > B2 £ {a) and a real analytic embedding $ : T™_ CT — > 
Tg such that for all x £ B £ (o), 

<S>*{X^ X) + P) = X X + P + 

with the estimates 

|$ - Id| s _ CT < d 2 \a\-H a {Q)e, \P + \ s -c < 

Proof. Since d = n, we have P = X v for some vector r? € C n , and as \P\ S < e, then |r/| < e. 
So let us define (p : B £ (a) — > B2 £ (a) to be the translation 

tp{x) = x — r], x G B £ (a). 

Take any x 6 B £ (a), then \<p(x) — a\ < \x — a\ + \rj\ < 2e and we can write 

X(p( x ) + P = X a + X ip ^_ a + P 

and by condition (3.15), we can apply Lemma 3.5 with S = X^ x )_ a , 5 = 2e and c = 1/2, to 
find a real analytic embedding : T™_ CT — >■ T" such that 

**(x v{x) + P) = x v{x)+v + P+ = X X + P + 

with the estimates 

|$ - Id| s _ CT < #\a\- x V a {Q)e, \P + \ s -a < 2" 1 £. 
This was the statement to prove. □ 
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Let us point out that the transformation <3? in the above lemma is indeed independent of 
the choice of y G B2 £ (a), simply because in Lemma 3.5 the transformation is independent of 
the choice of S. To understand this more precisely, let us notice that the transformation in 
Lemma 3.5 does not depend directly on a, but on the choice (for a given Q sufficiently large) 
of n periodic vectors coj for which 

\a -Uj\< d{\a\TjQ)- 1 , |a| _1 < Tj < d|a| _1 ^ Q (Q). 

Now the point is that if y is sufficiently close to a, one can choose the same periodic vectors and 
consequently the same transformation: the first part of condition (3.15) implies (in particular) 
that 

\y - a\ < 2e < (Q^(Q))- 1 < d^T.Qy 1 

so that for all 1 < i < n, 

\y — Wj| < \y — a\ + \a — < 2d(\a\TjQ)~ l . 

Hence the same transformation can be used if we replace a by y, and the same estimates 
can be obtained (by imposing slightly stronger assumptions, as it is done in the first part of 
condition (3.15), to compensate the factor 2 in the last inequality). 

Now it will be easy to use Lemma 3.6 infinitely many times to prove Theorem 3.3. At the 
beginning, y can lie anywhere in B2 £ {a). After one step, if P = X v , then y has to belong to 
the image of the map (p defined above, and this image is a smaller ball, namely B e (—rj). After 
m steps, y has to belong to a ball of radius e m _i = 2 m ~ 1 e and so at the limit, y = a + j3 for 
some unique (3 6 C n with |/3| < 2e. 

Proof of Theorem 3.3. Let C djCe = 2 5 d 2 \a\~ 1 (2 d - 1) and K dt0l = 2C djCe . For any m G N, and 
for Q > max(l, ci|a| _1 ) to be chosen below, let us define the monotonic sequences 

e m = 2- m e, A m = 2 m A a (Q), Q m = A* (A m ). 

Let also e_i = 2e, sq = s and for m G N, define 

With these choices, since A m e m = AqEo for all m G N, the first part of the condition 

3A m e m < 1, C d ^a m l < 1, (3.16) 

is satisfied for any m G N provided 3Ao£o = 3A a (Q)e < 1, while the second part is always 
satisfied by definition of a m . Hence we choose Q = A* ((3e) _1 ), and the requirement that 
Q > max(l, d|a| _1 ) is ensured by the first part of our threshold (3.5) in the statement of 
Theorem 3.3. Now note that 

Qm^aiQm) = A a (Q m ) = A a (A* (A m )) < A m , 

hence the condition (3.16) implies 

3Q m ^a(Qm)e m < 1, C d , a Q-}o^ < 1. (3.17) 
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We claim that for any m G N, there exist an embedding (p m : B £m _ 1 (a) — > B2 £ {a) and a real 
analytic embedding <E> m : T™ m — > T" such that for all x m € B £m _ 1 (a), 

with the estimates 

m— 1 

i<& m -id| Sm ^-^n- 1 E^r 1 . i p -i^< £ ™- ( 3 - 18 ) 

Indeed, for m = 0, choosing (p° and <I> to be the identity, and Po = P, there is nothing to 
prove. If we assume that the statement holds true for some m £ N, then by (3.17) we can 
apply Lemma 3.6 to the resulting vector field and an embedding (p m +i '■ B £m {a) — > B £ml (a) 
and a real analytic embedding <5 m +i : T" m+1 — > T" m are constructed. It is then sufficient to let 
^m+i = ^ m G $m+i = $ m Q ^fand P m +i = P£- The estimate |P m+ i| Sm+1 < e m+l 

is obvious, and since SQ m ^ a {Qm)e m < 1, we have 

|$ m+1 - Id| Sm+1 < d 2 |a|- 1 * Q (Q m ) £m < S-^Vl- 1 ^- 1 

and the estimate for <3? m+1 follows. 
Now let us prove that 

lim e m = 0, lim s m > s/2. (3.19) 

m— >+oo m— >+oc 

The first assertion is obvious. For the second one, Q m = A* (A m ) = A* (2 m A Q (Q)) hence 

dy n . , /• + °° cfe 



(Q) 



xA*(x) 



and therefore, using the second part of our condition (3.5) in the statement of Theorem 3.3 
and the fact that Qo = Q, we obtain 

E < Q" 1 + (1H2)- 1 / < (3.20) 

and by the definition of cx m , this gives 



m>0 m>0 

This implies that 

m— >+oo 



lim s m = s - V \a m > s/2. 



m>0 



From the first part of (3.19), when m goes to infinity, x m converges to a and therefore 
ip m converges to a trivial map (p : {a} — > B 2e (a). From (3.18) and (3.19), P m converges 
to zero uniformly on every compact subsets of T™^ 2 , while <3? m converges to a embedding 
$ : T™y 2 — >■ T™, uniformly on every compact subsets of T™y 2 . Since the space of real-analytic 
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functions is closed for the topology of uniform convergence on compact subsets, <3? is real- 
analytic, and from (3.20), we obtain the estimate 



1$ 



(r+oo 
Q- 1 + (ln2)- 1 / 
JA a (Q 



dx 



,(Q) xA* a (x) J ' 

Finally, if j3 G C n is the unique vector such that <p(a) = a + (3, then |/3| < 2e and we have 

<&*{X + Xp) = X a . 

This was the statement to prove. □ 

A Bruno-Riissmann condition 

Recall that the set of Bruno-Riissmann vectors Bd was defined in §2.2, and in §3 we introduced 
another condition (A). The aim of this short appendix is to prove the following proposition. 

Lemma A.l. A vector a satisfies condition (A) if and only if it belongs to 

In the proof, we shall make use of integration by parts and change of variables formulas 
for Stieltjes integral. 

Proof First let us prove that there exists a continuous, non-decreasing and unbounded func- 
tion $ : [1, +oo[— > [1, +oo[ such that 

^a(Q)<HQ)<^a(Q + l), Q>1. (A.l) 

Recall from Proposition 2.2 that ^ a = is left-continuous and constant on each interval 
]Qi,Qi+i], l € N* and on [l,Qi] with Q\ > 1. For any / € N*, let us choose a point 
h €]Qi, Qi+i[Ci[Qi + i - l,Qi+i[ and define 



HQ) 



V a (Qi+i), Qe]Qi,b t ], 

*a(Qi+l) + {Q~ bl)(Ql + l - k)- 1 (* a (Ql +2 ) ~ , Q Ql+l}- 



If Qi > 1, we choose a point bo £ [l,Qi[n[Qi — l,Qi[ and define &o on [ljQi] similarly, 
otherwise if Q\ = 1 we simply define $o(l) = ^a(l), so that finally the function defined by 

lew* 

has all the wanted properties. The existence of such a function $ shows that in Corollary 2.5, 
the integral condition may be written equivalently in terms of *& a , or also in terms of this 
function $. 

Now let A$(Q) = Q$(Q) and Ag 1 denotes the functional inverse of A$. From (A.l), we 
deduce the following inequalities: 

A Q (Q) < A$(Q) < A a (Q + 1), Q>1. 

A*(x) - 1 < A-\x) < A*(x), x>A a (l). 
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From (A. 2) it follows that a satisfies condition (A) if and only if 



/ A -i, x < °°- ( A -3) 

Ja,,(i) sA, (x) 



»(i) ( x ) 

By a change of variables, letting Q = Ag 1 (x), (A. 3) is equivalent to 

dA^(Q) 



, QA $ (Q) <0 °- < A " 4 > 



Now for t = A $ 1 (l) and T > t, an integration by parts gives 

T-HnA^T) + j\-*H^(Q))dQ = [ (A.5) 
and as T _1 In A$(T) > 0, letting T goes to infinity in (A.5), condition (A. 4) implies 

/+oo 

Q- 2 ln(A${Q))dQ < oo. (A.6) 



Now, using the fact that A$ is increasing and assuming that (A.6) holds true, we also have 

r 1 lnA$(T)= / Q~ 2 \n{A^{T))dQ < / Q~ 2 ln(A$(Q))dQ 

and therefore letting T goes to infinity in (A.5), condition (A.6) implies condition (A.4). 
So (A.4) and (A.6) are in fact equivalent. Since In A$(Q) = InQ + ln$(Q), (A.6) is clearly 
equivalent to 

/+oo 
Q- 2 ln{$ (Q))dQ <oo. (A.7) 

Hence conditions (A), (A. 3), (A.4), (A.6), (A.7) and a <G are an equivalent, and this ends 
the proof. □ 



B Technical estimates 

In this second appendix, we derive technical estimates concerning the Lie series method for 
vector fields. These are well-known (see [Fas90] for instance, where this formalism was first 
used as far as we aware of), but for completeness we prove the estimates adapted to our need. 

Lemma B.l. Let V be a bounded real-analytic vector field on T™ ; < ? < s and r = ^l^lj 1 - 
Fort £ C such that \t\ < r, the map V 1 : T™_ f — > T" is a well-defined real-analytic embedding, 
and we have 

|V*-Id| s _ 5 < \V\„ \t\<T. 
Moreover, V 1 depends analytically on t, for \t\ < r. 
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Proof. This is a direct consequence of the existence theorem for analytic differential equations 
and the analytic dependence on the initial condition: on the domain T™_ ? , for \t\ < r, V t is 
well-defined, depends analytically on t and satisfies the equality 



V* = Id + f V o V u du. 
Jo 

The statement follows. □ 

Lemma B.2. Let X and V be two bounded real- analytic vector fields on T™, and < q < s. 
Then 

\[X, V]\ 3 - f <2r 1 \X\ s \V\ s . 

Proof. First consider a real-analytic function / defined on T", and let Lyf the Lie derivative 
of / along V, that is 

£vf = j t (f o V% =0 = F'(0), F(t) = foV t . 

Now for \t\ < t = s\V\~ l , by Lemma B.l, F{t) is a well-defined real-analytic function on 
T"_ CT and moreover the map F is analytic in t, hence by the classical Cauchy estimate 

\C v f\s-a = \F'(0)\ < t- 1 sup |/ o V%- a < r-Vl- = ^ini/U- 

\t\<T 

Similarly, we have 

\Cxf\s-a<^ l \X\ s \f\ s . 

Now, for 1 < i < n, if X % and V % are the components of X and V, then CxVi — £v x i are 
the components of [X, V], so each component of [X, V] is bounded, on the domain T™_ a , by 
2<j~ 1 |X| s |F| s and therefore 

\[X, V]\ S ^<2^ 1 \X\ S \V\ S 
which is the desired estimate. □ 

Lemma B.3. Let X and V be two bounded real-analytic vector fields on T", and < c < s. 
Assume that \V\ S < (ie)- 1 ^. Then for all \t\ < 1, 

\{v t yx\ s _ a <2\x\ s . 

Proof. From the general identity 

j t (v t yx = (v t y[x,v] 

we have the formal Lie series expansion 

(v'yx = Y,(^r 1 x n t n , x = X, X n = [X n -i, V], n > 1. 

nGN 

Let Si = s — in~ x q for 1 < i < n, so that so = s and s n = s — q. Using Lemma B.2, we have 
l-X'nls-e = \ X n\s n = \[ x n-i,V]\ Sn < 2nq 1 \X n -i\ Sn _ 1 \V\ Sn _ 1 
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and by induction 



\X n \ s ^ < (2n^ l ) n \X \ S() \V\l = {2n^ 1 ) n \X\ s \V\ n s . 



By assumption, \V\ S < (4e) 1 q so 

\X n \ s ^ < (2e)- n n n \X\ 

and therefore 



Kv'yxu^ < \x\ s Y,(nirH^)- n (n\t\r. 



n>0 

Now for any n £ N*, n\ > n n e~ n , hence (n!) _1 (2e) _n (n|i|) n < (2 _1 |i|) n and since |i| < 1, the 
above series is bounded by 2. This proves the lemma. □ 

The above estimate can be easily improved, but for t / 0, the constant 2 cannot be 
replaced by 1. 
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